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Abstract 



The degrees of polynomials representing or approximating Boolean functions are a prominent 
tool in various branches of complexity theory. Sherstov SheOSa recently characterized the 
minimal degree deg^if) among all polynomials (over K.) that approximate a symmetric function 
/ : {0, 1}" {0, 1} up to worst-case error e: 



degeif) = e (de5i/3(/) + Vn\og{l/e) 



In this note we show how a tighter version (without the log- factors hidden in the 0-notation) , can 
be derived quite easily using the close connection between polynomials and quantum algorithms. 

1 Introduction 

Boolean functions are one of the primary objects of study in theoretical computer science. Such 
functions can be represented or approximated by polynomials in a number of ways, and the algebraic 
properties of such polynomials (such as their degree) often give information about the complex- 
ity of the function involved. Areas where this approach has been used include circuit complex- 
ity |Raz87l[S5Io87[lBei93j . complexity classes |BRS95[ iB^IMl ITod91| . decision trees |NS941[BW02] . 
communication complexity [BWOll IRazO.Sl ISheOSbl [LSOS] . and learning theory |M()Sn4[[LMMVn5j . 

In this note we focus on polynomials over the field of real numbers. An n-variate multilinear 
polynomial p is a function p : M" — > M that can be written as 

p(xi, ...,Xn)= ^ 05 

SC[n] ies 

for some real numbers as- The degree of p is deg{p) = min{|S'| | as ^ 0}. If is well known (and easy 
to show) that every function / : {0, 1}" — > M has a unique representation as such a polynomial; 
deg{f) is defined as the degree of that polynomial. 

In many applications it suffices if the polynomial is close to / instead of being equal to it: 

Definition 1 The e-approximate degree of f : {0, 1}" ^ M is 

degeif) = inm{deg{p) \ Vx G {0, 1}" : \p{x) - f{x)\ < e}. 
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A function / is called symmetric if its value only depends on the Hamming weight |x| of its 
input X £ {0,1}". Equivalently, /(x) = /(7r(x)) for all x £ {0,1}"" and all permutations vr G 5„. 
We will restrict attention here to symmetric functions /. Examples are OR, AND, PARITY, 
MAJORITY etc. Since the only thing that matters is the Hamming weight |x| of the input, one 
can actually restrict attention to univariate polynomials. We say that a univariate polynomial p 
e- approximates a symmetric function / if |p(|x|) — f{x)\ < e for all x G {0, 1}"". By a technique 
called symmetrization |MP68j . it turns out that for symmetric functions, the minimal degree of 
such univariate ^-approximating polynomials is the same degree degsif) as for n-variate multilinear 
polynomials. Hence we can switch back and forth between these two kinds of polynomials at will. 

Paturi |Pat92] tightly characterized the 1/3-approximate degree degi/^{f) of all symmetric / 
(see the start of Section [2] for the precise statement). Recently, Sherstov |She08a| studied the 
dependence on the error e. He proved the surprisingly clean result that for all e G [2"", 1/3], 

degeif) = e (degy^U) + V™log(l/e)) , 

where the notation hides some logarithmic factors. Note that the statement is false if e <C 2"", 
since clearly deg{f) < n for all /. 

Sherstov gave an interesting application of his result in the context of the inclusion-exclusion 
principle of probability theory. Let / : {0, 1}" — > {0, 1} be a Boolean function. Suppose one has 
events Ai, . . . ,An in some probability space, and one knows the exact values of Pr[nigsj4j] for all 
sets S C [n] of size at most k. How well can we now estimate Pr[/(Ai, . . . , An)]? Sherstov gives 
essentially tight bounds for this for all symmetric functions /, based on his degree-result. This 
generalizes earlier results for the case where / is the OR function, i.e. where one is estimating 
Pr[Uie[„]Ai] [LN90llKLS96| . 

In this note we give a different proof, for a slightly tighter version of Sherstov's degree-result: 

Theorem 1 For every non-constant symmetric function f : {0, 1}"" {0, 1} and e G [2"", 1/3]; 

degeif) = {degi/3{f) + V«log(l/e)) . 

Note that there are no hidden logarithmic factors anymore. As a consequence, the result on 
approximate inclusion-exclusion is sharpened as well, but we won't elaborate on that here. 

The lower bound on degeif) follows immediately from combining Paturi's tight bound for 
degi/sif) with the tight bound on the e-approximate degree of the OR- function proved in |BCWZ99] . 
More interestingly, our upper bound is obtained by exhibiting an efficient e-error quantum algo- 
rithm for computing a symmetric function. It is well known (at least in quantum circles) that the 
acceptance probability of a quantum algorithm that makes T queries to its input can be written 
as an n-variate multilinear polynomial of degree at most 2T [BBC+ni| . The upper bound of The- 
orem [1] actually applies to a larger class of functions, namely all functions that are constant when 
|x| G {t, . . . ,n — t}. These functions may be arbitrary (possibly non-symmetric) for smaller or 
larger Hamming weights. For every such function we have degeif) = 0{y/tn+ log(l/e)). 

Discussion 

The main message of this note is that one can obtain essentially optimal polynomial approxima- 
tions of symmetric Boolean functions by arguing about quantum algorithms. This fits in a line 
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of papers in recent years that prove or reprove theorems about various topics in classical com- 
puter science or mathematics with the help of quantum computational techniques. This includes 
results about locally decodable codes |KW041 IWWOSj , classical proof systems for lattice problems 
inspired by earlier quantum proof systems |AR03l I AR04j . limitations on classical algorithms for 
local search jAarOSj inspired by an earlier quantum proof, a proof that the complexity class PP 
is closed under intersection [AarOSj . lower bounds on the rigidity of Hadamard matrices |Wol06] . 
classical formula size lower bounds from quantum query lower bounds [LLSOSj , and an approach to 
proving lower bounds for classical circuit depth using quantum communication complexity |Ker07] . 

There are advantages as well as disadvantages to our approach in this note. We feel that for 
someone familiar with quantum algorithms and their connection to polynomials, our proof should 
be quite simple and straightforward. Also, our bound applies to a larger class of functions, and is 
tight up to constant instead of logarithmic factors. On the other hand, for those unfamiliar with 
quantum computation our proof is probably not that accessible. Another disadvantage is that we 
do not construct the e-approximating polynomials explicitly (though one may derive them from 
our quantum algorithm), in contrast to Sherstov's construction based on Chebyshev polynomials. 



2 Proof 

Let / : {0, 1}" {0, 1} be a non-constant symmetric function that is constant if the Hamming 
weight \x\ of the input is in the interval {t, .., n — t} (where < i < n/2 is the smallest t for which 
this holds). We know degi/^{f) = @{\^tn) from Paturi jPat92j . In the next two subsections we 
provide matching upper and lower bounds on degsif), thus proving Theorem [TJ 

2.1 Upper bound on degir{f) 



Beals et al. jBBC"'"Ol] showed that the acceptance probability of a T-query quantum algorithm 



on n-bit input is a multilinear n-variate polynomial p : M"" — > M of degree at most 2T. Hence it 
suffices to give an e-error quantum algorithm for / that uses 0{degi/^{f) + \Jn log(l/e)) queries. 
The acceptance probability of the algorithm will be our e-error polynomial. 

Here is the algorithm. It uses various quantum algorithms based on Grover's search algorithm, 
which are explained in the appendix. Let x S {0, 1}" be the input string. The algorithms have 
access to this string via queries. In the quantum case, one query is one application of the unitary 
that maps \i) ^ (— l)^''|i). A solution is an index i £ [n] such that Xj = 1. 

1. Use t repeated applications of exact Grover to try to find up to t solutions (initially assuming 
|x| = t, and "crossing out" in subsequent applications the solutions already found). If |x| < t, 
then with probability 1 these repeated applications find all solutions. This costs 0{\/tn) 
queries. 

2. Use e/2-error Grover to try to find one more solution. This costs 0{\/n log(l/e)) queries. 

3. The same as step 1, but now looking for positions of Os instead of Is. 

4. The same as step 2, but now looking for a instead of a 1. 



The total number of queries is indeed 0(v^+ \Ailog(T7e)). We need to show that this gives error 
probability at most < e for every input x G {0, 1}". Observe the following: 
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• if step 1 found t solutions, then we know > t with probabihty 1 (note that you can verify 
whether a given position is a solution with only 1 extra query). 

• if step 1 found fewer than t solutions, but step 2 found another solution, then we know |x| > t 
(for if |x| < t then step 1 would certainly have found all solutions and there would be none 
left to be found in step 2). 

• if step 1 found fewer than t solutions, but step 2 did not find another solution, then the 
probability that there are more solutions than those found by step 1, is at most e/2 (because 
step 2 ran an e/2-error search algorithm which didn't find any solution). 

• similar observations for steps 3 and 4 (with Os and Is switching roles). 

These observations imply that at the end of the 4 steps we have enough information to compute /. 
Note that with probability at least 1 — e we can distinguish between the three cases |x| < t, 
\x\ £ {t, . . . ,n — t}, and |x| > n — t. If |x| € {t, . . . , n — t} then we are done because / is constant on 
this interval. If \x\ < t then step 1 found all solutions, so we know x completely and can compute 
f{x). If > n — t then step 2 found all non-solutions of x, and again we know x completely. In 
all cases we compute f{x) with error probability at most e. 

This algorithm even works for many non-symmetric functions: it suffices if / is constant on all 
inputs with Hamming weight in {i, . . . , n — t}; / may be arbitrary if \x\ < t or \x\ > n — t since in 
these cases the algorithm actually determines x completely, rather than just its Hamming weight. 

2.2 Lower bound on deg^if) 

We can assume t < n/4, because if t > n/4 then we already have a tight bound from Paturi: 

n > degif) > deg,{f) > degy^{f) = G(n). 

Buhrman et al. |BCWZ99] showed for the n-bit OR function that dege{OKn) = Q{^Jn log(l/e))0 
Since t < n/4, we can embed an OR on at least n — 2t > n/2 bits into / by fixing some of the bits 
to specific values. Hence 

degeif) > max {degii'i{f),Q.{y'n log(l/e))) = Vl (^degif^{f) + v^nlog(l/e)) . 
Acknowledgments 
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A Grover's algorithm and applications 



Grover's quantum algorithm jGro96j for finding a solution (i.e. an i S [n] such that Xi = 1) consists 
of T applications of a certain unitary G, starting from the uniform superposition Yl^=i 1^)- 
won't explain the details of G here. Suffice it to say that each G makes one quantum query, so the 
total number of queries is T. The intuition is that G changes the state by moving amplitude from 
non-solutions to solutions. One can show |BHMT02] that the probability that a measurement of 
the state after T steps gives a solution, is exactly 

(sin((2r+ 1)6*))^ where = arcsin(y^|x|/n). 

If > and T = [(7r/4) Y^n/|x|] , then this probability is close to 1. Hence if we know (at 
least approximately) the number of solutions then we can find one with good probability using 
0{\/ n/\x\) queries. If we know |x| exactly, a small modification of the algorithm finds a solution 
with probability 1 |BHMT02] . This uses exactly [(7r/4)iyn/|x|] queries; we will refer to it as "exact 
Grover" . 

What if we don't know how many solutions there are in the input? We can first apply Grover 
assuming the number of solutions is n/2, then assuming it is n/4 etc. This finds one solution with 
probability at least some constant, even if we don't know the number of solutions. The complexity 
is Yl^i^r 0(y^n/2*) = 0{^/n) queries. If we know there are at least t solutions, this can be improved 
to 0{\/n/t). We will refer to this as "usual Grover". 

And what if we want to have probability at least 1 — e of finding a solution? Buhrman et 
al. [BCWZ99] designed an algorithm that achieves this using 0{\Jn log(l/e)) queries, and showed 
(by proving the lower bound on deg^iOVC) mentioned in Section [2. 2p that this complexity is optimal 
up to a constant factor. Their algorithm is quite simple. Apply exact Grover log(l/e) times, first 
assuming there is 1 solution, then assuming there are 2 solutions, etc. If the actual number of 
solutions is between 1 and log(l/e), at least one solution will have been found with probability 1 by 
now. If no solution has been found yet, then apply usual Grover 0(log(l/e)) many times assuming 
there are at least t = log(l/e) solutions. It is easy to verify that this has overall query complexity 
Oi^sjn log(l/e)) and error probability at most e. We will refer to this as "e-error Grover". 

De Graaf and de Wolf |GW021 Lemma 2] observed that exact Grover can be used to find all 
solutions with probability 1, as long as we know an upper bound t on the number of solutions. 
Suppose we run exact Grover t times: the first time assuming we have exactly t solutions, the 
second time assuming we have exactly t — 1 solutions, etc. Each time we find a solution i, we "cross 
it out" in the sense of modifying the input by setting Xi to (this can easily be achieved by some 
unitary pre- and post-processing around the query). This prevents the algorithm from finding the 
same solution twice. The total number of queries used is 

t 

i=l 

To see that this finds all solutions with probability 1, observe that the assumed number of solutions 
t — i + 1 of the zth run always upper bounds the actual number of remaining solutions (this "loop 
invariant" is easily proved with downward induction). Hence if we start with at most t remaining 
solutions, then after t runs we end with solutions — meaning all solutions have been found. 
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